Entangled states maximize the two qubit channel capacity for some Pauli channels 

with memory. 
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We prove that a general upper bound on the maximal mutual information of quantum channels 
is saturated in the case of Pauli channels with an arbitrary degree of memory. For a subset of such 
channels we explicitly identify the optimal signal states. We show analytically that for such a class 
of channels entangled states are indeed optimal above a given memory threshold. It is noteworthy 
that the resulting channel capacity is a non-differentiable function of the memory parameter. 
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The study of the optimal coding and decoding of infor- 
mation in quantum systems has a long history Q]. The 
advent of present day quantum information theory 01 
has not only revived interest in the subject but has also 
opened new problems. A key open question concerns the 
additivity of channel capacity when entangled states are 
used as signals. Although entanglement is a ubiquitous 
ingredient in nearly all quantum information processing 
protocols and algorithms, it is often regarded as being 
very fragile in the presence of environmental noise. This 
has led to the belief that in most circumstances the use 
of entanglement is not advantageous in the reliable trans- 
mission of classical information through quantum chan- 
nels. For those memoryless channels (i.e. ones in which 
the noise acting on consecutive uses of the channel is un- 
correlated) that have been studied so far, this is indeed 
the case. This was first proven analytically for the de- 
polarizing channel , where isotropic noise acts on indi- 
vidual qubits, and then extended to a more general form 
of memoryless unital channel f^. There has also been 
interesting recent work demonstrating that the poten- 
tial additivity of channel capacities is equivalent to other 
well known additivity conjectures in quantum informa- 
tion theory The scenario changes when the channel 
is not memoryless, i.e. when the noise acting on con- 
secutive uses is partially correlated. This phenomenon 
is not uncommon in physical situations, when the sta- 
tistical properties of the physical source of noise can be 
time - dependent. The problem of quantum channels 
with memory was first introduced in T|, where, for the 
case of depolarizing channels with memory, it was shown 
that the use of entangled states enhances the mutual in- 
formation. In Ref. l7| input states taken from a certain 
ansatz were considered, and it was shown that within 
this ansatz entangled states allow for the transmission of 
a larger amount of reliable information. However, it was 
not proved analytically that this ansatz is indeed opti- 
mal. Further results bounding the asymptotic capacities 
of noisy channels with memory have also recently been 



derived Q- 

Here for the first time we prove the optimality of a set 
of entangled input signal states for a class of Pauli chan- 
nels. To this end we will first obtain an upper bound 
on the channel capacity. We will then show that for the 
general case of Pauli channels with an arbitrary degree 
of memory this bound is saturated by states of minimal 
output entropy. For a class of Pauli channels we will 
derive these states explicitly. They turn out to be entan- 
gled above a given memory threshold and product states 
below it. 

In order to set the scenario let us first consider a single 
qubit channel that is a random implementation of the 
Pauli transformations: 

3 

p -^"^qiCTipai. (1) 

i=0 

where the qi give a probability distribution, and the ai 
are the Pauli matrices according to the following conven- 
tion: 

^0 = ( 1 ) ; '"1 ^ ( -1 ) 

We will sometimes refer to the Pauli matrices as the Pauli 
group, even though extra phases are required in order to 
make the matrices closed under matrix multiplication. 
However, since these phases cancel out when considering 
transformations of density operators, as 

(e'V,)p(e'V,)^ = (3) 

we will freely make this abuse of terminology. 

In the typical memoryless channel scenario, an under- 
standing of the action of an individual use, such as the 
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one described in Eq.Q, is sufficient to fully describe the 
operation of the channel. However, the possibility of re- 
peated access to the channel opens the question of opti- 
mizing the choice of signal states, including the ones that 
are entangled over many uses of the channel. This leads 
to questions concerning the additivity of channel capaci- 
ties, and whether entangled inputs and output measure- 
ments can lead to improvements in information trans- 
mission. However, in the manner of we would like to 
consider repeated applications of a single qubit channel 
that are not independent. In particular we will consider a 
two qubit channel that is almost equivalent to two inde- 
pendent uses of the single qubit channel aside from 
a memory factor fi G [0,1] that introduces correlations, 
i.e.: 

3 

P ^ ^ PijO-i CTj/do-j (g) aj (4) 

where 

Pi] = (1 - ^J■)q^qj + f^qAj. (5) 

We can see that this evolution can be considered as two 
independent applications of except for an additional 
effect due to the degree of memory /i, which with some 
probability forces the same Pauli transformation to be 
repeated in the second use of the channel. 

We would like to compute the maximum amount of in- 
formation that can be transmitted through a noisy chan- 
nel of the form (|SJ), and investigate how the use of entan- 
gled inputs in the two uses of the channel may improve 
its communication performance. To do this we will show 
that this is equivalent to finding the input pure state with 
minimal output entropy. 

The maximum mutual information of a general quan- 
tum channel £ is given by the Holevo-Schumacher- 
Westmoreland bound 0]: 

X{£) = max Si£{Y.p,p,)) - (6) 

■I % 

where S(lS) = — Tr([jlogw) is the von Neumann entropy 
of the density operator lo and the maximization is per- 
formed over all input ensembles {pi, pi\ into the channel 
(pi are the input states on which classical information 
is encoded, and are transmitted with prior probabilities 
Pi). Note that this bound incorporates a maximization 
over all POVM measurements at the receiver, including 
collective ones over multiple uses of the channel. 

In our scenario the pi describe states of two qubits, 
and so we will refer to the maximum mutual informa- 
tion x(f ) as the two-qubit capacity of the channel. We 
will find it convenient to use the symbol p*{£) to denote 
a chosen input state that gives minimal output entropy 
when transmitted through the channel £. As the maxi- 
mally mixed state gives the largest possible entropy for 



any system, the formula © can clearly be bounded from 
above by 

Xi£) < log2(4) - Sip4£)) = 2 - Sip4£)). (7) 

for any 2-qubit channel. We will now see that this upper 
bound can be achieved by any 2-qubit channel whose ac- 
tion consists of random tensor products of Pauli transfor- 
mations. The argument that we use to demonstrate this 
can be applied to any channel that is covariant with re- 
spect to an irreducible representation of a compact group, 
and has been independently noted by Holevo The 
key ingredients will be the facts that the Pauli matrices 

(a) form an irreducible representation of a group, and 

(b) either commute or anticommute. Indeed, as these 
are essentially the only ingredients required, the same 
argument can easily be modified to multiqubit channels 
whose actions consist of random tensor products of Pauli 
matrices. 

Let us consider an ensemble of input states given by 
the sixteen states defined by pij := ai®ajp^Ui®aj, each 
with the same input probability 1/16. The commutation 
relations of the Pauli matrices imply that any channel 
£ of the form (O is covariant with respect to the Pauli 
rotations 

£{(Ji ® (JjP*(Ji (g) (Tj) = CTi (g) (Tjf (/9*)CTi (g) (Jj (8) 

As entropy is invariant under unitary transformations, 
we can immediately write 

Si£{p.)) = S{£{p.,,)), (9) 

and therefore each of the states pij will also give the same 
minimal output entropy as . Furthermore, the fact that 
the group of matrices {ui ® ctj} is an irreducible repre- 
sentation means that the ensemble will give an average 
output state that is maximally mixed |11| 

^ Yl -^P^i = Yl 3^'^' ® crJS{p)'^^ ® = ^ ■ (10) 

Inserting equations © and IjlOfl into equation © we can 
see that the upper bound Q) is attained by the input 
ensemble of states pij :— Ui ® Ujp^Ui (g Uj with equal 
prior probabilities. This means that to optimise the in- 
formation transmission of our channel, we merely need 
to search for the input state that minimises the output 
entropy. We will refer to any such state as an optimal 
input state. 

In a specific form of memory channel was investi- 
gated, where the weights in equation ((Sj were fixed by 

I ~ X 

qo^ X ■ 91 = 92 = 93 = (11) 

and the degree of memory p was allowed to take any 
value in the interval [0, 1]. An ansatz for the form of the 
optimal input state was conjectured, but a full analytic 
proof is still lacking. 
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Consequently, here we will focus our attention on a 
kind of memory channel for which we can give an entirely 
analytic solution. The form of the channel is character- 
ized by the following parameters in equation |(SJ) 



go = gi = J3 



92 = 93 = g , 



(12) 



where q — {1 ~ 2p)/2. 

In order to identify the optimal input states we will 
first show that we can restrict our attention to input 
states that are invariant under the symmetry group 
{(To (8> ctq, (Ti (E) cTi}. The technique that we will use may 
be generalised to many other channels with a suitable 
structure Let us first consider the following mod- 

ification of the channel £: first rotate the input state 
by (Ti cTi, and then act with £. Let us call this new 
channel £':— £ o (cti ® cti). Using the standard relations 
for the Pauli group: CToCTi = cri, ctiCTi = ao, 0'2(Ti = icra 
and a^ai — —ia2, and the fact that the Pauli matrices 
are hermitian, we can see that preoperating with ai (g) cri 
does not make any difference to the action of this chan- 
nel, and therefore: 



£' = £. 



(13) 



We can also trivially say the same thing if we preoperate 
with the identity operation (Tq ctq. Let us now consider 
the following 'averaging' preoperation: 



(Topao ® ctq + CTi ® CTipcTi ® (Ti) (14) 



From the arguments above follows immediately the 
equality 



£oT^£ 



(15) 



i.e. preoperating on our state with does not affect the 
operation of the above channel. Since by construction 
corresponds to averaging over the group {ctq'^o'o, ai'^ai} 
we need only to consider input states that are invariant 
under it. Let us denote by R the whole set of 2 qubit 
density matrices. We are looking for the explicit form 
of an input state p G R which minimizes the output 
entropy. If we find such an optimal state p*, then by 
the above arguments the input state J-{p*) will also give 
the same output entropy, and will therefore also be opti- 
mal. This means that instead of looking for the optimal 
state in R, we can instead restrict our search to find- 
ing an optimal state from the restricted set J-{R). Since 
the optimal state e ^{R) that minimizes the out- 
put entropy is by construction invariant under the group 
{(To (E) (To ) ci fX" (Ti } it can easily be checked that in the ba- 
sis {|00), |01), |10), |11)} represented by the eigenvectors 
of (Ti ® iTi it must take the form 



a c 

d / 

/* e 

c* 6 



(16) 



From the form of follows that it is a convex combi- 
nation of pure states of the form 



ct|00) -f /3|11) or 
a\0l)+l3\lQ) . 



(17) 



We will now prove that to minimise the output von 
Neumann entropy we can restrict our attention to an 
input pure state of the form IjlTI) . Let us write p.^, in 
terms of its pure state decomposition 



P* = ^pMi){'^i\ ■ 

i 

Then the action of the channel wih give 



(18) 



(19) 



and hence by the concavity of the von Neumann entropy 
entropy 2] we have 



s{£ip.))>j2p^s{£mm)) 



(20) 



In particular suppose without loss of generality that 
is the pure state in the decomposition of that gives 
the lowest output entropy from all the eigenvectors of 
p*. Then the above equation implies that 



Si£{p.))>Si£{\^i){^i\)) 



(21) 



So indeed, as we have assumed that p* is already op- 
timal, this means that this last equation is actually a 
strict equality, and hence one of its eigenvectors will also 
be optimal, namely 



5(5(p.)) = 5(5(|V'i)(^i|)) 



(22) 



Therefore, we can restrict our attention to finding an 
input pure state of the form (|17|l . 

Let us rewrite without loss of generality the input state 
((T7jl as 



iV'e,^) =cos6»|00)-t-e*'^sin6i|ll) 



(23) 



The corresponding state at the output of the channel 
takes the form 



£{\i^e.,4,){i^e.,4,\) 



[(To ® (To + 77 cos 20((To (E) (Tl -I- (Tl (g) (To) 



+Cai (g) (Ti -f /i sin 26 cos (j){a2 (T2 — (T3 (g) era) 

-f p(4p - 1) sin 20 sin (^((T2 (E) (J3 + cts (^2)] , (24) 

where ry = {Ap — 1) and C = p + (1 — p)??^. As we can 
easily verify, the above density operator has the following 
eigenvalues 
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Ai,2 = ^(1-^^) 

A3,4 = + 

±iy^r;2 cos2 261 + fi^ sin^ 26l(cos2 (/) + r;2 sin^ 0) (25) 

As we can infer from the above form of the eigenvalues, 
the input state corresponding to the minimum entropy is 
given by (/) = 0. Moreover, when /i > 77, or equivalently 
p < (/i + l)/4, the input state with minimum entropy is 
the maximahy entangled state (|17l) with 6' = 7r/4. In the 
other case, when fj, < Ap — I, the input state correspond- 
ing to the minimum output entropy is a product state of 
the form |00). 

The set of optimal 16 states discussed above, that max- 
imizes the mutual information along the channel, reduces 
in these cases to a set of four equiprobable input orthog- 
onal states. Therefore, similarly to the case of the de- 
polarizing channel with memory 0, we can identify the 
onset of a threshold value Ht = 4p — 1, above which the 
mutual information along the channel is maximized by 
using equiprobable Bell states. Below the threshold the 
use of entanglement does not bring any benefit since the 



information is optimized by transmitting product states, 
such as the set {|00), |01), |10), |11)}. It is noteworthy 
that the resulting channel capacity is a non-differentiable 
function of the memory parameter /i. 

In conclusion, we have studied the performance of 
Pauli channels with memory effects for the transmission 
of classical information, and we have provided a complete 
proof that a certain class of Pauli channels exhibits the 
onset of a threshold on the degree of memory. We have 
shown that below this threshold the two qubit capacity 
of the channels is achieved by input product states, while 
above it the capacity is achieved by maximally entangled 
input states. This is the first time that entanglement is 
rigorously proven to be a precious resource in the trans- 
mission of classical information in the presence of noise. 
Our results so far have covered a class of Pauli channels, 
characterized by a single noise parameter. However, we 
have numerical evidence that the onset of the thresh- 
old, and the corresponding enhancement of information 
transmission by using entangled states, are features of 
most two-qubit Pauli channels with correlated noise |l3j | . 
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